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Abstract 

Convex algebraic geometry concerns the interplay between opti- 
mization theory and real algebraic geometry. Its objects of study 
include convex semialgebraic sets that arise in semidefinite program- 
ming and from sums of squares. This article compares three notions of 
duality that are relevant in these contexts: duality of convex bodies, 
duality of projective varieties, and the Karush-Kuhn- Tucker condi- 
tions derived from Lagrange duality. We show that the optimal value 
of a polynomial program is an algebraic function whose minimal poly- 
nomial is expressed by the hypersurface projectively dual to the con- 
straint set. We give an exposition of recent results on the boundary 
structure of the convex hull of a compact variety, we contrast this to 
Lasserre's representation as a spectrahedral shadow, and we explore 
the geometric underpinnings of semidefinite programming duality. 



1 Introduction 

Dualities are ubiquitous in mathematics and its applications. This article 
compares several notions of duality that are relevant for the interplay between 
convexity, optimization, and algebraic geometry. It is primarily expository, 
and is intended for a diverse audience, ranging from graduate students in 
mathematics to practitioners of optimization who are based in engineering. 

Duality for vector spaces lies at the heart of linear algebra and functional 
analysis. Duality in convex geometry is an involution on the set of convex 
bodies: for instance, it maps the cube to the octahedron and vice versa 
(Figured]). Duality in optimization, known as Lagrange duality, plays a key 
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Figure 1: The cube is dual to the octahedron. 



role in designing efficient algorithms for the solution of various optimization 
problems. In projective geometry, points are dual to hyperplanes, and this 
leads to a natural notion of projective duality for algebraic varieties. 

Our aim here is to explore these dualities and their interconnections in the 
context of polynomial optimization and semidefmite programming. Towards 
the end of the Introduction, we shall discuss the context and organization of 
this paper. At this point, however, we jump right in and present a concrete 
three-dimensional example that illustrates our perspective on these topics. 



1.1 How to Dualize a Pillow 

We consider the following symmetric matrix with three indeterminate entries: 
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This symmetric 4x4- matrix specifies a 3-dimensional compact convex body 
P = {(x,y,z)eR 3 | Q{x,y,z) h 0}. (1.2) 
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Figure 2: A 3-dimensional spectrahedron P and its dual convex body P A . 

The notation £ V 0" means that the matrix is positive semidefinite, i.e., all 
four eigenvalues are non-negative real numbers. Such a linear matrix inequal- 
ity always defines a closed convex set which is referred to as a spectrahedron. 

Our spectrahedron P looks like a pillow. It is shown on the left in Figure El 
The algebraic boundary of P is the surface specified by the determinant 

det(Q(x,y,z)) = x 2 {y - zf - 2x 2 - y 2 - z 2 + 1 = 0. 

The interior of P represents all matrices Q(x, y, z) whose four eigenvalues 
are positive. At all smooth points on the boundary of P, precisely one 
eigenvalue vanishes, and the rank of the matrix Q(x, y, z) drops from 4 to 3. 
However, the rank drops further to 2 at the four singular points 

(x,y,z) = -L(i,i,-i), -L(-i, -1,1), -L(i, -1,1), -L(-i, i,-i). 

We find these from a Grobner basis of the ideal of 3 x 3-minors of Q(x, y, z): 

{2a; 2 - 1, 2z 2 - 1, y + z}. 

The linear polynomial y + z in this Grobner basis defines the symmetry plane 
of the pillow P. The four corners form a square in that plane. Its edges are 
also edges of P. All other faces of P are exposed points. These come in two 
families, called protrusions, one above the plane y + z = and one below it. 
Like all convex bodies, our pillow P has an associated dual convex body 

P A = { (a, b, c) e M 3 | ax + by + cz < 1 for all (x, y,z) G P } , (1.3) 
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consisting of all linear forms that evaluate to at most one on P. Our notation 
P A is chosen to be consistent with that in Ziegler's text book §2-3]. 

The dual pillow P A is shown on the right in Figure |2j Note the associa- 
tion of faces under duality. The pillow P has four 1-dimensional faces, four 
singular O-dimensional faces, and two smooth families of O-dimensional faces. 
The corresponding dual faces of P A have dimensions 0, 2 and respectively. 

Semidefinite programming is the computational problem of minimizing a 
linear function over a spectrahedron. For our pillow P, this takes the form 

p*(a, b, c) = Maximize ax + by + cz 

{x,y,z)m 3 n 4) 

subject to Q(x, y, z) >z 0. 

We regard this as a parametric optimization problem: we are interested in 
the optimal value and optimal solution of f ll.4p as a function of (a, b, c) el 3 . 
This function can be expressed in terms of the dual body P A as follows: 

p*(a,b,c) = Minimize A 



1 A (L5) 

subject to — ■ (a,b,c) G P . 

A 

We distinguish this formulation from the duality in semidefinite program- 
ming. The dual to fll.4p is the following program with 7 decision variables: 

d*(a, b, c) = Minimize u\ + u 4 + u 6 + u 7 

u-x —2uo—a\ 

(1.6) 
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Since (jl.4j) and f ll.6p are both strictly feasible, strong duality holds (U §5.2.3], 
i.e. the two programs attain the same optimal value: p*(a, b, c) = d*(a, b, c). 
Hence, problem ( II. 6p can be derived from (II. 5p . as we shall see in Section |5j 
We write M(u;a,b,c) for the 4x4- matrix in ( II. 6p . The following equa- 
tions and inequalities, known as the Karush-Kuhn- Tucker conditions (KKT), 
are necessary and sufficient for any pair of optimal solutions: 

Q(x, y, z) ■ M(u; a, b, c) = 0, (complementary slackness) 
Q(x,y,z) y 0, 
M{u]a,b,c) y 0. 
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We relax the inequality constraints and consider the system of equations 

A = ax + by + cz and Q(x, y, z) ■ M(u; a, 6, c) = 0. 

This is a system of 11 equations. Using computer algebra, we eliminate the 
10 unknowns x,y, z,U\, . . . , 117. The result is a polynomial in a, b, c and A. Its 
factors, shown in f ll.7p - fll.8p . express the optimal value A* in terms of a, b, c. 

At the optimal solution, the product of the two 4x4-matrices Q(x,y,z) 
and M(u; a, b, c) is zero, and their respective ranks are either (3, 1) or (2, 2). 
In the former case the optimal value A* is one of the two solutions of 

(ft 2 + 2bc + c 2 ) • A 2 - a 2 b 2 - a 2 c 2 - 6 4 - 2b 2 c 2 - 26c 3 - c 4 - 26 3 c = 0. (1.7) 

In the latter case it comes from the four corners of the pillow, and it satisfies 

(2A 2 - a 2 + lab - b 2 + 26c - c 2 - 2ac) , , 

■ (2A 2 - a 2 - lab - b 2 + 26c - c 2 + 2ac) = 0. { } 

These two equations describe the algebraic boundary of the dual body P A . 
Namely, after setting A = 1, the irreducible polynomial in fll.7p describes the 
quartic surface that makes up the curved part of the boundary of P A , as seen 
in Figure [2j In addition, there are four planes spanned by flat 2-dimensional 
faces of P A . The product of the four corresponding affine-linear forms equals 
f ll.8p . Indeed, each of the two quadrics in f ll.8p factors into two linear factors. 
These two characterize the planes spanned by opposite 2-faces of P A . 

The two equations fll.7p and f ll.Bp also offer a first glimpse at the concept 
of projective duality in algebraic geometry. Namely, consider the surface in 
projective space P 3 defined by det(Q(x, y, z)) = after replacing the ones 
along the diagonals by a homogenization variable. Then f ll.7p is its dual 
surface in the dual projective space (P 3 )*. The surface fll.8p in (P 3 )* is dual 
to the 0-dimensional variety in P 3 cut out by the 3x3-minors of Q(x,y,z). 

The optimal value function of the optimization problem fll.4p is given by 
the algebraic surfaces dual to the boundary of P and its singular locus. We 
have seen two different ways of dualizing fll.4p : the dual optimization problem 
f ll.6p . and the optimization problem fll.5p on P A . These two formulations 
are related as follows. If we regard fll.6p as specifying a 10-dimensional 
spectrahedron, then the dual pillow P A is a projection of that spectrahedron: 

P A = {(a,6,c) G M 3 I 3u e R 7 : M(u;a,b,c) h and n 1 +n 4 +n 6 +n 7 = l}. 
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Linear projections of spectrahedra are called spectrahedral shadows. These 
objects play a prominent role in the interplay between semidefinite program- 
ming and convex algebraic geometry. The dual body to a spectrahedron is 
generally not a spectrahedron, but it is always a spectrahedral shadow. 

1.2 Context and Outline 

Duality is a central concept in convexity and convex optimization, and nu- 
merous authors have written about their connections and their interplay with 
other notions of duality and polarity. Relevent references include Barvinok's 
text book [TJ §4] and the survey by Luenberger [TJ5]. The latter focuses on 
dualities used in engineering, such as duality of vector spaces, polytopes, 
graphs, and control systems. The objective of this article is to revisit the 
theme of duality in the context of convex algebraic geometry. This emerging 
field aims to exploit algebraic structure in convex optimization problems, 
specifically in semidefinite programming and polynomial optimization. In 
algebraic geometry, there is a natural notion of projective duality, which 
associates to every algebraic variety a dual variety. One of our goals is to 
explore the meaning of projective duality for optimization theory. 

Our presentation is organized as follows. In Section [2] we cover preliminar- 
ies needed for the rest of the paper. Here the various dualities are carefully 
defined and their basic properties are illustrated by means of examples. In 
Section E] we derive the result that the optimal value function of a polynomial 
program is represented by the defining equation of the hypersurface projec- 
tively dual to the manifold describing the boundary of all feasible solutions. 
This highlights the important fact that the duality best known to algebraic 
geometers arises very naturally in convex optimization. Section H] concerns 
the convex hull of a compact algebraic variety in MJ 1 . We discuss recent work 
of Ranestad and Sturmfels [2U [25] on the hypersurfaces in the boundary of 
such a convex body, and we present several new examples and applications. 

In Section [5] we focus on semidefinite programming (SDP), and we offer 
a concise geometric introduction to SDP duality. This leads to the concept 
of algebraic degree of SDP [E1I22], or, geometrically, to projective duality for 
varieties defined by rank constraints on symmetric matrices of linear forms. 

A spectrahedral shadow is the image of a spectrahedron under a linear 
projection. Its dual body is a linear section of the dual body to the spectrahe- 
dron. In Section[H]we examine this situation in the context of sums-of-squares 
programming, and we discuss linear families of non-negative polynomials. 
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2 Ingredients 



In this section we review the mathematical preliminaries needed for the rest 
of the paper, we give precise definitions, and we fix more of the notation. We 
begin with the notion of duality for vector spaces and cones therein, then 
move on to convex bodies, polytopes, Lagrange duality in optimization, the 
KKT conditions, projective duality in algebraic geometry, and discriminants. 

2.1 Vector Spaces and Cones 

We fix an ordered field K. The primary example is the field of real numbers, 
K — R, but it makes much sense to also allow other fields, such as the 
rational numbers K = Q or the real Puiseux series K = IR{{e}}. For a 
finite dimensional A'- vector space V, the dual vector space is the set V* = 
Hom(V, K) of all linear forms on V. Let V and W be vector spaces and 
ip : V — > W a linear map. The dual map ip* : W* — > V* is the linear map 
defined by ip* (w) = w o <p> g V* for every w G W*. If we fix bases of V and 
W then ip is represented by a matrix A. The dual map </?* is represented, 
relative to the dual bases for W* and V*, by the transpose A 1 of the matrix A. 

A subset C C V is a cone if it is closed under multiplication with positive 
scalars. A cone C need not be convex, but its dual cone 

C* = {l eV* \Vx eC : l(x) > } (2.1) 

is always closed and convex in V*. If C is a convex cone then the second 
dual (C*)* is the closure of C . Thus, if C is a closed convex cone in V then 

(C*)* = C. (2.2) 

This important relationship is referred to as biduality. 

Every linear subspace L C V is also a cone. Its dual cone is the orthogonal 
complement of the subspace: 

L* = L 1 - = { I G V* | Vx G L : l(x) = } . 

The dual map to the inclusion L C V is the projection 717, : — > V* /L^. 
Given any cone C C V, the intersection C fl L is a cone in L. Its dual cone 
(C fl L)* is the projection of the cone C into V* / L^. More precisely, 

(cmy = c* + l ± m v*. 
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Now, it makes sense to consider this convex set modulo L . We thus obtain 

(CnL)* = tt l (C*) in V*/L L . (2.3) 

This identity shows that projection and intersections are dual operations. 

A subset F C C of a convex set C is a face if F is itself convex and 
contains any line segment L C C whose relative interior intersects F. We 
say that F is an exposed face if there exists a linear functional I that attains 
its minimum over C precisely at F. Clearly, an exposed face is a face, but the 
converse does not hold. For instance, the edges of the red triangle in Figure [6] 
are non-exposed faces of the 3-dimensional convex body shown there. 

An exposed face F of a cone C determines a face of the dual cone C* via 

F° = { I G C* | I attains its minimum over C at F } . 

The dimensions of the faces F of C and F° of C* satisfy the inequality 

dim(F) + dim(F°) < dim(V). (2.4) 

If C is a polyhedral cone then C* is also polyhedral. In that case, the number 
of faces F and F° is finite and equality holds in (12 .4p . On the other hand, 
most cones considered in this article are not polyhedral, they have infinitely 
many faces, and the inequality in (12 .4p is usually strict. For instance, the 
second order cone C = { (x, y, z) G M 3 : a/x 2 + y 2 < z} is self-dual, each 
proper face F of C is 1-dimensional, and the formula (12 .4p says 1 + 1 < 3. 

2.2 Convex Bodies and their Algebraic Boundary 

A convex body in V is a full- dimensional convex set that is closed and bounded. 
If C is a cone and z G int(C*) then C (1 {z = 1} is a convex body in the 
hyperplane { z = 1 } of V. In this manner, every pointed d- dimensional 
cone gives rise to a {d— l)-dimensional convex body, and vice versa. These 
transformations, known as homogenization and dehomogenization, respect 
faces and algebraic boundaries. They allow us to go back and fourth be- 
tween convex bodies and cones in the next higher dimension. For instance, 
the 3-dimensional body P in (11.21) corresponds to the cone in IR 4 we get by 
multiplying the constants 1 on the diagonal in (II. ip with a new variable. 

Let P be a full-dimensional convex body in V and assume that G int(P). 
Dehomogenizing the definition for cones, we obtain the dual convex body 

P A = { i e V* | Vx G P : £{x) < 1 } . (2.5) 
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This is derived from (12.1 j) using the identification /(x) = z — for z = 1. 
Just as in the case of convex cones, if P is closed then biduality holds: 




P. 



The definition (12 .5p makes sense for arbitrary subsets P of V . That is, P need 
not be convex or closed. A standard fact from convex analysis [261 Cor. 12.1.1 
and §14] says that the double dual is the closure of the convex hull: 



All convex bodies discussed in this article are semialgebraic, that is, they 
can be described by polynomial inequalities. We note that if P is semialge- 
braic then its dual body P A is also semialgebraic. This is a consequence of 
Tarski's theorem on quantifier elimination in real algebraic geometry (2j[3]. 

The algebraic boundary of a semialgebraic convex body P, denoted d a P, 
is the smallest algebraic variety that contains the boundary dP. In geometric 
language, d a P is the Zariski closure of dP. It is identified with the squarefree 
polynomial fp that vanishes on dP. Namely, d a P = V(fp) is the zero set of 
the polynomial fp. Note that fp is unique up to a multiplicative constant. 
Thus d a P is an algebraic hypersurface which contains the boundary dP. 

A polytope is the convex hull of a finite subset of V. If P is a polytope 
then so is its dual P A [29] . The boundary of P consists of finitely many facets 
P. These are the faces F = v ° dual to the vertices v of P A . The algebraic 
boundary d a P is the arrangement of hyperplanes spanned by the facets of 
P. Its defining polynomial fp is the product of the linear polynomials v — 1. 

Example 2.1. A polytope known to everyone is the three-dimensional cube 

P = conv{(±l,±l,±l)} = {-l<x,y,z<l}. 

Figure [T] illustrates the familiar fact that its dual polytope is the octahedron 

P A = {-\<a±b±c<\} = conv{±ei,±e 2 ,±e 3 }. 

Here e\ denotes the ith unit vector. The eight vertices of P correspond to 
the facets of P A , and the six facets of P correspond to the vertices of P A . 
The algebraic boundary of the cube is described by a degree 6 polynomial 




conv(PUO). 



d a P = V(( 



x 2 -l)(y 2 -l)(, 2 -l)). 
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f A 
I J 

Figure 3: The unit balls for the L4 norm and the L4/3 norm are dual. The 
curve on the left has degree 4, while its dual curve on the right has degree 12. 

The algebraic boundary of the octahedron is given by a degree 8 polynomial 
d a P A = V (Jj(l-a±j/±c)j](a±6±c+l)). 

Note that P and P A are the unit balls for the norms and Li on R 3 . □ 

Recall that the L p -norm on IR n is defined by \\x\\ p = (5^™ =1 |£j| p ) 1//p for 
x E M n . The dual norm to the L„-norm is the Z^-norm for - + - — 1, that is, 

p y p q ' ' 

\\y\\ q = sup{(y,x) I x E M n , \\x\\ p < 1}. 

Geometrically, the unit balls for these norms are dual as convex bodies. 

Example 2.2. Consider the case n = 2 and p = 4. Here the unit ball equals 

P = { (2, y) G M 2 : x A + y A < 1 }. 

This planar convex set is shown in Figured In this example, since the curve is 
convex, the ordinary boundary coincides with the algebraic boundary, d a P = 
dP, and is represented by the defining quartic polynomial x 4 + y A — 1. 
The dual body is the unit ball for the L 4 / 3 -norm on IR 2 : 

P A = {( a , b)ER 2 : H 4 / 3 + \b\ 4/3 < 1} . 

The algebraic boundary of P A is an irreducible algebraic curve of degree 12, 

d a P A = v( a 12 +3a 8 6 4 +3a 4 6 8 +6 12 -3a 8 +21a 4 6 4 -36 8 +3a 4 +36 4 -l), (2.6) 

which again coincides precisely with the (geometric) boundary dP A . This 
dual polynomial is easily produced by the following one-line program in the 
computer algebra system Macaulay2 due to Grayson and Stillman 
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R = QQ[x,y,u,v]; eliminate ({x, y} , ideal (x~4+y~4-l ,x~3-u,y~3-v) ) 

In Subsection 2.4 we shall introduce the algebraic framework for performing 
such duality computations, not just for curves, but for arbitrary varieties. □ 

2.3 Lagrange Duality in Optimization 

We now come to a standard concept of duality in optimization theory. Let us 
consider the following general nonlinear polynomial optimization problem: 

minimize f(x) 

X 

subject to g%{x) < 0, % — 1, . . . , m, 

(2.7) 

hj(x) = 0, j = l,...,p. 

Here the g\, . . . , g m , hi, . . . , h p and / are polynomials in R[xi, . . . , x n ]. The 
Lagrangian associated to the optimization problem (12.71) is the function 

L : R n x R™ x W R n 

(x, A,/i) ^ f{x) + YT=l^i9ii x ) + Y?j=lN h A X ) 

The scalars \ G M+ and fij G R are the Lagrange multipliers for the con- 
straints gi(x) < and hj(x) = 0. The Lagrange function L(x,A,/i) can 
be interpreted as an augmented cost function with penalty terms for the 
constraints. For more information on the above formulation see [U §5.1]. 
One can show that problem (12.71) is equivalent to finding 

u* = minimize maximize L(x,X,li). 

x£M n /iGKPandA>0 

The key observation here is that any positive evaluation of one of the poly- 
nomials gi(x), or any non-zero evaluation of one of the polynomials hj(x), 
would render the inner optimization problem unbounded. 

The dual optimization problem to (12.71) is obtained by exchanging the 
order of the two nested optimization subproblems in the above formulation: 

v* = maximize minimize L(x,X,u). 

/^eIRPandA>0 x&R n 

S v ' 

4>(x,n) 

The function 0(A, /i) is known as the Lagrange dual function to our problem. 
This function is always concave, so the dual is always a convex optimization 
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problem. It follows from the definition of the dual function that 0(A, fi) < u* 
for all A, fi. Hence the optimal values satisfy the inequality 

v < u . 

If equality happens, v * = u*, then we say that strong duality holds. A neces- 
sary condition for strong duality is X*gi(x*) — for all i — 1, . . . , m, where 
x*, A* denote the primal and dual optimizer. We see this by inspecting the 
Lagrangian and taking into account the fact that hj(x) = for all feasible x. 

Collecting all inequality and equality constraints in the primal and dual 
optimization problems yields the following optimality conditions: 

Theorem 2.3 (Karush-Kuhn- Tucker (KKT) conditions). Let (x*,\*,fi*) be 
primal and dual optimal solutions with u* = v* (strong duality). Then 



V,/ + V A* • V x9i + V fi * ■ V x hj = 0, 

X* X* X* 

1=1 j=l 

9i{x*) < for % — 1, . . . , m, 

A* > for i — 1, . . . , m, 

h 3 (x*) = ioTj = l,...,p, (2.8) 

Complementary slackness: A* • gi(x*) = forz = 1, . . . ,m. 

For a derivation of this theorem see [H §5.5.2]. Several comments on the 
KKT conditions are in order. First, we note that complementary slackness 
amounts to a case distinction between active (g, = 0) and inactive inequalities 
(gi < 0). For any index i with gi(x*) ^ we need A, = 0, so the corresponding 
inequality does not play a role in the gradient condition. On the other hand, 
if gi(x*) = 0, then this can be treated as an equality constraint. 

From an algebraic point of view, it is natural to relax the inequalities and 
to focus on the KKT equations. These are the polynomial equations in (12. 8ft : 

hx(x) = ■■■ = h p (x) = Xmix) = \ m 9m{x) = 0. (2.9) 

If we wish to solve our optimization problem exactly then we must compute 
the algebraic variety in R n x R m x MP that is defined by these equations. 

In what follows we explore Lagrange duality and the KKT conditions in 
two special cases, namely in optimizing a linear function over an algebraic 
variety (Section [3D and in semidefinite programming (Section [5]). 
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2.4 Projective varieties and their duality 

In algebraic geometry, it is customary to work over an algebraically closed 
field, such as the complex numbers C. All our varieties will be defined over 
a subfield K of the real numbers R, and their points have coordinates in 
C. It is also customary to work in projective space P n rather than affine 
space C n , i.e., we work with equivalence classes x ~ Ax for all A £ C\{0}, 
x £ C n+1 \{0}. Points (xo : x\ : • • • : x n ) in projective space P n are lines 
through the origin in C n+1 , and the usual affine coordinates are obtained by 
dehomogenization e.g. with respect to xo (i.e. setting xo = 1). All points 
with Xo = are then considered as points at infinity. We refer to Chapter 
8] for an elementary introduction to projective algebraic geometry. 

Let / = (hi , . . . , hp) be a homogeneous ideal in the polynomial ring 
K[xo, X\, . . . , x n ]. We write X = V(I) for its variety in the projective space 
pn over q fj^g s i n g U i ar locus Sing(A) is a proper subvariety of A. It 
is defined inside X by the vanishing of the c x c-minors of the mx(n+l)- 
Jacobian matrix J(A) = (dhi/dxj), where c = codim(A). See [51 §9.6] for 
background on singularities and dimension. While the matrix J(X) depends 
on our choice of ideal generators h iy the singular locus of X is independent 
of that choice. Points in Sing(A) are called singular points of X. We write 
A reg = A\Sing(A) for the set of regular points in X. We say that the 
projective variety X is smooth if Sing(A) = 0, or equivalently, if X = A reg . 

The dual projective space (P n )* parametrizes hyperplanes in P™. A point 
(u : • • • : u n ) £ (P n )* represents the hyperplane {x £ P™ | J2™ =0 UiXi = 0}. 
We say that u is tangent to A at a regular point x £ A rcg if x lies in that 
hyperplane and its representing vector (ui, . . . , u n ) lies in the row space of 
the Jacobian matrix J(X) at the point x. 

We define the conormal variety CN(A) of A to be the closure of the set 

{(x, u) £ P n x (P n )* | x £ A rcg and u is tangent to A at x }. 

The projection of CN(A) onto the second factor is denoted A* and is called 
the dual variety. More precisely, the dual variety X* is the closure of the set 

{ u £ (P n )* | the hyperplane u is tangent to A at some regular point }. 

Proposition 2.4. The conormal variety CN(A) has dimension n — 1. 

Proof. We may assume that A is irreducible. Let c = codim(A). There are 
n—c degrees of freedom in picking a point x in A reg . Once the regular point 
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x is fixed, the possible tangent vectors u to X at x form a linear space of 
dimension c— 1. Hence the dimension of CN(X) is (n— c) + (c— 1) = n—1. □ 

Since the dual variety X* is a linear projection of the conormal variety 
CN(AT), Proposition 12.41 implies that the dimension of X* is at most is n — 
1. We typically expect X* to have dimension n — 1, i.e. regardless of the 
dimension of X, the dual variety X* is typically a hypersurface in (P n )*. 

Example 2.5 (Example 12.21 cont.). Fix coordinates (x:y:z) on P 2 and con- 
sider the ideal I = (x A + y 4 — z 4 ). Then X = V(I) is the projectivization of 
the quartic curve in Example 12.21 The dual curve X* is the projectivization 
of the curve d a P A in ( 12.60 . Hence, X* is a curve of degree 12 in (P 2 )*. □ 

To compute the dual X* of a given variety X, we can utilize Grobner 
bases [HI Ej as follows. We augment the ideal / with the bilinear polynomial 
Xir=o M * Xi an d au the (c + 1) x (c + l)-minors of the matrix obtained from 
Jac(X) by adding the extra row u. Let J' denote the resulting ideal in 
K[xq, ■ ■ ■ , x n , ito, ... , u n ]. In order to remove the singular locus of X from 
the variety of J', we replace J' with the saturation ideal 

J := ( J' : (c x c-minors of Jac(X) )°°). 

See [HI Exercise 8 in §4.4] for the definition of saturation of ideals. 

The ideal J is bi-homogeneous in x and u respectively. Its zero set in 
pn x (jpny j g conorma i variety CN(X). The ideal of the dual variety X* 
is finally obtained by eliminating the variables xq, . . . , x n from J: 

ideal of the dual variety X* = J n K[uq, u±, . . . , u n }. (2-10) 

As was remarked earlier, the expected dimension of X* is n — 1, so the 
elimination ideal (12.100 is expected to be principal. We seek to compute its 
generator. We shall see many examples of such dual hypersurfaces later on. 

Theorem 2.6. (Biduality, [3 Theorem 1.1]) 

Every irreducible projective variety IcP satisfies 

(x*y = x. 

Proof idea. The main step in proving this important theorem is that the 
conormal variety is self-dual, in the sense that CN(X) = CN(X*). In this 
identity the roles of x G P n and u G (P n )* are swapped. It implies (X*)* = X. 
A proof for the self-duality of the conormal variety is found in [TJ §1.1.3]. □ 
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Example 2.7. Suppose that X C P n is a general smooth hypersurface of 
degree d. Then X* is a hypersurface of degree d(d — l) n_1 in (P n )*. A 
concrete instance for d — 4 and n = 2 was seen in Examples 12.21 and 12.51 □ 

Example 2.8. Let X be the variety of symmetric m x m matrices of rank 
at most r. Then X* is the variety of symmetric m x m matrices of rank at 
most m — r §1.1.4]. Here the conormal variety CN(X) consists of pairs of 
symmetric matrices A and B such that A ■ B = 0. This conormal variety will 
be important for our discussion of duality in semidefinite programming. □ 

An important class of examples, arising from toric geometry, is featured in 
the book by Gel'fand, Kapranov and Zelevinsky 0. A projective toric variety 
Xa in P n is specified by an integer matrix A = (a , ai, . . . , a n ) of format 
d x (n+1) and rank d whose row space contains the vector (1,1,. ..,1). We 
define X A as the closure in P n of the set {(t a ° : t ai : • • • : t a ") 1 1 G (C\{0}) d }. 

The dual variety X\ is called the A- discriminant. It is usually a hyper- 
surface, in which case we identify the A- discriminant with the irreducible 
polynomial that vanishes on X\. The A- discriminant is indeed a dis- 
criminant in the sense that its vanishing characterizes Laurent polynomials 

n 
3=0 

with the property that the hypersurface {p(t) = 0} has a singular point in 
(C\{0})° ! . In other words, we can define (and compute) the A-discriminant as 



A A = jc G (P-)* | 3t G (C\{0}Y with p(t) = J| = • • • = J| = 

Example 2.9. Let d = 2, n = 4, and fix the matrix 

/4 3 2 1 
^0 1 2 3 4 

The associated toric variety is the rational normal curve 

X A = {(4: t\t 2 : t\t\ : t x t\ : t$) G P 4 | (ti : t 2 ) G P 1 } 

= V(xqX2— x\, X0X3— X1X2, XqX^—xI, XiX^—xl, X1X4—X2X3, X2X4— x\) . 
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A hyperplane { J^ =0 CjXj = 0} is tangent to Xa if and only if the binary form 

p(ti,t 2 ) = c 4 + citift + c 2 t\tl + c 3 tlt 2 + c4\ 
has a linear factor of multiplicity > 2. This is controlled by the A- discriminant 



/ c 


Cl 


c 2 


c 3 


C4 





o\ 





c 


Cl 




c 3 


c 4 











Co 


Cl 


c 2 


c 3 


c 4 


Cl 


2c 2 


3c 3 


4c 4 














Cl 


2c 2 


3c 3 


4c 4 














Cl 


2c 2 


3c 3 


4c 4 













Cl 


2c 2 


3c 3 


4cJ 



(2.11) 



given here in form of the determinant of a Sylvester matrix. The sextic 
hypersurface X\ = V(Aa) is the dual variety of the curve Xa- □ 



3 The Optimal Value Function 

In this section we examine the optimization problem (12. 7p under the hy- 
potheses that the cost function f(x) is linear and that there are no inequality 
constraints gi(x). The purposes of these restrictions is to simplify the pre- 
sentation and focus on the key ideas. Our analysis can be extended to the 
general problem (12.71) and we discuss this briefly at the end of this section. 

We consider the problem of optimizing a linear cost function over a com- 
pact real algebraic variety X in R n : 

Cq = minimize (c, x) 

(3.1) 

subject to x E X = {v E W l | h(v ) = ■ ■ • = h p (v) = 0} . 

Here hi, h 2 , ■ ■ ■ , h p are fixed polynomials in n unknowns X\, . . . ,x n . The 
expression (c, x) = C\X\ + • • • + c n x n is a linear form whose coefficients 
Ci,...,c n are unspecified parameters. Our aim is to compute the optimal 
value function Cq. Thus, we regard the optimal value Cq as a function M. n — > 
K. of the parameters c\, . . . ,c n , and we seek to determine this function. 

The hypothesis that X be compact has been included to ensure that 
the optimal value function Cq is well-defined on all of M n . Again, also this 
hypothesis can be relaxed. We assume compactness here just for convenience. 
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Our problem is equivalent to that of describing the dual convex body P A 
of the convex hull P = conv(X), assuming that the latter contains the origin 
in its interior. A small instance of this was seen in (11. 5p . Since our convex hull 
P is a semi-algebraic set, Tarski's theorem on quantifier elimination in real 
algebraic geometry [21 [3] ensures that the dual body P A is also semialgebraic. 
This implies that the optimal value function Cq is an algebraic function, i.e., 
there exists a polynomial $(co, ci, . . . , c n ) in n + 1 variables such that 

$(c*, Cl ,...,c n ) = 0. (3.2) 

Our aim is to compute such a polynomial $ of least possible degree. The 
input consists of the polynomials h±, . . . ,h p that cut out the variety X. The 
degree of $ in the unknown Co is called the algebraic degree of the optimization 
problem (12. 7p . This number is an intrinsic algebraic complexity measure for 
the problem of optimizing a linear function over X. For instance, if ci, . . . , c n 
are rational numbers then the algebraic degree indicates the degree of the field 
extension K over Q that contains the coordinates of the optimal solution. 

We illustrate our discussion by computing the optimal value function and 
its algebraic degree for the trigonometric space curve featured in [24"| §1]. 

Example 3.1. Let X be the curve in M 3 with parametric representation 

(xi, X2, X3) = (cos(#), sin(2#), cos(3#)) . 
In terms of equations, our curve can be written as X = V(h\, h 2 ), where 

hi = x\ — x\ — %ix-i and h 2 = £3 — 4x? + 3x\. 
The optimal value function for maximizing 01X1+02X2+03X3 over X is given by 

$ = (11664c|) • eg + (864c? cf, + 1512c?c|cJ - 19440c? cf 

+576ci c|c 3 - 1296ci cfc| + 64c!] - 25272cf cf - 34992c|j) • 4 
+ {1644 + 8c i c i c 3 - 1152c i c i - 1920^^ + 8208c|c| - 724c? c|c 3 + 144c? 44 
+44 ~ 17280c? 4 ~ 8044 ~ 2802c?c|c| - 3456c? 44 + 3888c? 4 ~ H20cic§ c 3 
+540ci 44 + 55080ci 44 ~ 128c l - 208c|cf +15417cf c|+15552c|cf +34992cf ) • eg 
+ (-16c? c| - 8cl4c 3 + 256c? c| - 44 + 328c f 44 ~ 1600c? cf + 114c? 0^3 
-2856c? 44 + 4608c? c| + 12c?c^ - 1959c?c|c| + 9192444 - 4320c?c| 
-528c? c|c 3 + 7644c? 44 ~ 7704c i 44 ~ 6912c? 4 ~ 48c?cl + 3592c?cfc§ 
-4863c? 44 ~ 13608c? 44 + 15552c? 4 + 800c i c l c 3 - 400c i 44 ~ 10350ci 44 
+16200ci 44 + 64c 2° + 8044 ~ 1^44 + 135c|dj + 9720c|c| - 116644°). 
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The optimal value function Cq is the algebraic function of ci, c 2 , c 3 obtained 
by solving $ = for the unknown Cq. Since Co has degree 6 in $, we see that 
the algebraic degree of this optimization problem is 6. Note that we can write 
Cq in terms of radicals in c±, c 2 , c 3 because there are no odd powers of c in $, 
which ensures that the Galois group of Cq over Q(c 1; c 2 , c 3 ) is solvable. □ 

We now come to the main result in this section. It will explain what 
the polynomial $ means and how it was computed in the previous example. 
For the sake of simplicity, we shall first assume that the given variety X is 
smooth, i.e. X = X rcg , where the set X rcg denotes all regular points on X. 

Theorem 3.2. Let X* C (P n )* be the dual variety to the projective closure of 
X. If X is irreducible, smooth and compact in W 1 then X* is an irreducible 
hypersurface, and its defining polynomial equals $(— co,cx, . . . ,c n ) where $ 
represents the optimal value function as in ( 13. 2 j) of the optimization problem 
( 13. ip . In particular, the algebraic degree of ( 13. ip is the degree in Co of the 
irreducible polynomial that vanishes on the dual hypersurface X* . 

Here the change of sign in the coordinate cq is needed because the equation 
Co = C\X\ + • • • + c n x n for the objective function value in M. n becomes the 
homogenized equation (— co)xq + C\X\ + • ■ ■ + c n x n = when we pass to P™. 

Proof. Since X is compact, for every cost vector c there exists an optimal 
solution x*. Our assumption that X is smooth ensures that x* is a regular 
point of X, and c lies in the span of the gradient vectors V a /ii| » for i = 
1, . . . , p. In other words, the KTT conditions are necessary at the point x*: 

p 

i=l 

hi(x*) =0 for i — 1, 2, . . . , p. 

The scalars AJ, . . . , A* express c as a vector in the orthogonal complement of 
the tangent space of X at x*. In other words, the affine hyperplane {x £ M n : 
(c, x) = Cq} contains the tangent space of X at x*. This means that the pair 
( x* , (— Cq : Ci : • ■ ■ : c n ) ) lies in the conormal variety CNpT) C P n x (P n )* 
of the projective closure of X. By projection onto the second factor, we see 
that (— Cq : ci : • • • : c n ) lies in the dual variety X*. 

Our argument shows that the boundary of the dual body P A is a sub- 
set of X*. Since that boundary is a semialgebraic set of dimension n — 1, 
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we conclude that X* is a hypersurface. If we write its defining equation as 
$(— c , Ci, . . . , c n ) = 0, then the polynomial $ satisfies (13. 2p . and the state- 
ment about the algebraic degree follows as well. □ 

The KKT condition for the optimization problem ( 13. 1} involves three sets 
of variables, two of which are dual variables, to be carefully distinguished: 

1. Primal variables x\, . . . ,x n to describe the set X of feasible solutions. 

2. (Lagrange) dual variables Ai, . . . , X p to parametrize the linear space of 
all hyperplanes that are tangent to X at a fixed point x*. 

3. (Projective) dual variables Cq, c\, . . . , c n for the space of all hyperplanes. 
These are coordinates for the dual variety X* and the dual body P A . 

We can compute the equation $ that defines the dual hypersurface X* 
by eliminating the first two groups of variables x = (xi, . . . ,x n ) and A = 
(Ai, . . . , A p ) from the following system of polynomial equations: 

c = (c, x) and h\{x) = ■ ■ • = h p (x) = and c = XiV x hi + • • • + \ P V x h p . 

Example 3.3. (Example 12. 21 cont.) We consider (13. ip with n — 2, p = 1 and 
hi = x\ + x\ — 1. The KKT equations for maximizing the function 

c = CiXi + c 2 x 2 (3.3) 

over the "TV screen" curve X = V(hi) are 

c 1 = X 1 -4x 3 1 , c 2 = A 1 -4xf, x\ + x\ = \. (3.4) 

We eliminate the three unknowns xi,X2, Ai from the system of four polyno- 
mial equations in (13. 3p and ( 13. 4p . The result is the polynomial $(— cq, c\, c 2 ) 
of degree 12 which expresses the optimal value Cq as an algebraic function of 
Ci and c 2 . We note that $(1, Cx, c 2 ) is precisely the polynomial in ( 12. 6p . □ 

It is natural to ask what happens with Theorem 13.21 when X fails to be 
smooth or compact, or if there are additional inequality constraints. Let us 
first consider the case when X is no longer smooth, but still compact. Now, 
X ieg is a proper (open, dense) subset of X. The optimal value function Cq for 
the problem ( 13. ip is still perfectly well-defined on all of R n , and it is still an 
algebraic function of ci, . . . , c n . However, the polynomial $ that represents 
Cq may now have more factors than just the equation of the dual variety X*. 
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Figure 4: The bicuspid curve in Example 13.41 



Example 3.4. Let n = 2 and p = 1 as in Example 13.31 but now we consider 
a singular quartic. The bicuspid curve, shown in Figure HI has the equation 

h = ( X 2_ 1 )( Xl _ 1 )2 + (x 2_ l) 2_ 

The algebraic degree of optimizing a linear function C\X\ + C2X2 over X = 
V(hi) equals 8. The optimal value function Cq = Cq(ci,C2) is represented by 

$ = (c - Ci + c 2 ) • (c - Ci - c 2 ) • (I6cg - 48( C 2 + c 2) c 4 + 

(24c?c| + 21cj + 64cf)c 2 + (54cic 2 t +32cf )c + 8c^-3c?c^+llc^) . 

The first two linear factors correspond to the singular points of the bicuspid 
curve X , and the larger factor of degree six represents the dual curve X*. □ 

This example shows that, when X has singularities, it does not suffice 
to just dualize the variety X but we must also dualize the singular locus of 
X. This process is recursive, and we must also consider the singular locus 
of the singular locus etc. We believe that, in order to characterize the value 
function $, it always suffices to dualize all irreducible varieties occurring in 
a Whitney stratification of X but this has not been worked out yet. In our 
view, this topic requires more research, both on the theoretical side and on 
the computational side. The following result is valid for any variety X in M. n . 
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Corollary 3.5. If the dual variety of X is a hypersurface then its defining 
polynomial contributes a factor to the value function of the problem 



This result can be extended to an arbitrary optimization problem of the 
form (12. 7p . We obtain a similar characterization of the optimal value Cg 
as a semi- algebraic function of c\, C2, ■ ■ ■ , c n by eliminating all primal vari- 
ables x\, . . . ,x n and all dual (optimization) variables x, A,/i from the KKT 
equations. Again, the optimal value function is represented by a unique 
square-free polynomial $(c , ci, . . . , c n ), and each factor of this polynomial is 
the dual hypersurface Y* of some variety Y that is obtained from X by set- 
ting gi(x) = for some of the inequality constraints, by recursively passing to 
singular loci. In Section we shall explore this for semidefinite programming. 

We close this section with a simple example involving A-discriminants. 

Example 3.6. Consider the calculus exercise of minimizing a polynomial 

q{t) = cit + c 2 t 2 + c 3 t 3 + c 4 t 4 
of degree four over the real line R. Equivalently, we wish to minimize 

C = CiXi + C 2 X 2 + C3X3 + C4X4 

over the rational normal curve Xa D {xq = 1} = V(x\ — £2, x\ — £3, x\ — £4), 
seen in Example 12.91 The optimal value function Cq is given by the equation 
A^(— Co, ci, C2, C3, C4) = 0, where is the discriminant in (12. lip . Hence the 
algebraic degree of this optimization problem is equal to three. □ 

4 An Algebraic View of Convex Hulls 

The problem of optimizing arbitrary linear functions over a given subset 
of M n , discussed in the previous section, leads naturally to the geometric 
question of how to represent the convex hull of that subset. In this section 
we explore this question from an algebraic perspective. To be precise, we 
shall study the algebraic boundary d a P of the convex hull P = conv(X) of 
a compact real algebraic variety X in MJ 1 . Biduality of projective varieties 
(Theorem 12. 6j) will play an important role in understanding the structure of 
d a P. The results to be presented are drawn from [2H 125] . In Section [6] we 
shall discuss the alternative representation of P as a spectrahedral shadow. 
We begin with the seemingly easy example of a plane quartic curve. 
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Figure 5: A quartic curve in the plane can have up to 28 real bitangents. 



Example 4.1. We consider the following smooth compact plane curve 
X = {(x,y) el 2 | 144x 4 +14V-225(x 2 +y 2 )+350xV+81 = 0}. (4.1) 

This curve is known as the Trott curve. It was first constructed by Michael 
Trott in [28] , and is illustrated above in Figure [5] A classical result of alge- 
braic geometry states that a general quartic curve in the complex projective 
plane P 2 has 28 bitangent lines, and the Trott curve X is an instance where all 
28 lines are real and have a coordinatization in terms of radicals over Q. Four 
of the 28 bitangents form edges of conv(X). These special bitangents are 

r , , . ^48050 + 434 v / 9889 

{(x,y) ER 2 \±x±y = 7}, where 7 = = 1.2177... 

248 

The boundary of conv(A) alternates between these four edges and pieces of 
the curve X. The eight transition points have the floating point coordinates 

( ± 0.37655..., ± 0.84122...) , ( ± 0.84122..., ± 0.37655...). 

These coordinates lie in the field Q(7) and we invite the reader to write them 
in the form q\ + ^27 where qi e Q. The Q-Zariski closure of the 4 edge lines 
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of conv(X) is a curve Y of degree 8. Its equation has two irreducible factors: 

(992x 4 -3968x 3 ?/+5952x 2 ?/ 2 -3968a;?/ 3 +992?/ 4 -1550a; 2 +3100x?/-1550?/ 2 +117) 
(992x 4 +3968x 3 y+5952x 2 y 2 +3968x?/ 3 +992y 4 -1550x 2 -3100x?/-1550?/ 2 +117) 

Each reduces over M to four parallel lines, two of which contribute to the 
boundary. The point of this example is to stress the role of the (arithmetic 
of) bitangents in any exact description of the convex hull of a plane curve. □ 

We now present a general formula for the algebraic boundary of the convex 
hull of a compact variety J in 1™. The key observation is that the algebraic 
boundary of P = conv(X) will consist of different types of components, 
resulting from planes that are simultaneously tangent at k different points 
of X, for various values of the integer k. For the Trott curve X in Example 
14.11 the relevant integers were k = 1 and k = 2, and we demonstrated that 
the algebraic boundary of its convex hull P is a reducible curve of degree 12: 



In the following definitions we regard X as a complex projective variety in P n . 

Let X^ be the variety in the dual projective space (P n )* which is the 
closure of the set of all hyperplanes that are tangent to X at k regular 
points which span a (k — l)-plane in P™. This definition makes sense for 
k — 1, 2, . . . , n. Note that X^ coincides with the dual variety X*, and X^ 
parametrizes all hyperplanes that are tangent to X at two distinct points. 
Typically, X® is an irreducible component of the sing ular locus of X* = X™. 
We have the following nested chain of projective varieties in the dual space: 



We now dualize each of the varieties in this chain. The resulting varieties 
(X^)* live in the primal projective space P n . For k = 1 we return to our 
original variety, i.e., we have (XW)* = X by biduality (Theorem ESD- In 
the following result we assume that X is smooth as a complex variety in P n , 
and we require one technical hypothesis concerning tangency of hyperplanes. 

Theorem 4.2. [221 Theorem 1.1] Let X be a smooth and compact real alge- 
braic variety that affinely spans M. n , and such that only finitely many hyper- 
planes are tangent to X at infinitely many points. The algebraic boundary 
d a P of its convex hull, P = conv(X), can be computed by biduality as follows: 



X U Y. 



(4.2) 



x [n] g x [n-l] C • • ■ C X [2] C X [1] C (P n )*. 



n 




(4.3) 
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Since d a P is pure of codimension one, in the union we only need indices k 
having property that (X*- k ')* is a hypersurface in P n . As argued in [23], this 
leads to the following lower bound on the relevant values to be considered: 

k >- rssfci- (44) 

The formula ( I4.3P computes the algebraic boundary d a P in the following 
sense. For each relevant k we check whether (X^)* is a hypersurface, and, 
if yes, we determine its irreducible components (over the field K of interest). 
For each component we then check, usually by means of numerical computa- 
tions, whether it meets the boundary OP in a regular point. The irreducible 
hypersurfaces which survive this test are precisely the components of d a X. 

Example 4.3. When X is a plane curve in M 2 then (14. 3[) says that 

d a P C XU(X [2] )*. (4.5) 

Here X^ is the set of points in (P 2 )* that are dual to the bitangent lines of 
X, and (X®)* is the union of those lines in P 2 . If we work over K = Q and 
the curve X is general enough then we expect equality to hold in ( 14.51) . For 
special curves the inclusion can be strict. This happens for the Trott curve 
(14. ip since Y is a proper subset of (X' 2 ')*. Namely, Y consists of two of the six 
Q-components of (X^)*. However, a small perturbation of the coefficients in 
(14. ip leads to a curve X with equality in (I4.5p . as the relevant Galois group 
acts transitively on the 28 points in X^ for general quartics X. Now, the 
algebraic boundary over Q is a reducible curve of degree 32 = 28 + 4. □ 

If we are given the variety X in terms of equations or in parametric form, 
then we can compute equations for X^ by an elimination process similar 
b of the dual variety X*. However, expressing the tangency condition at k 
different points requires a larger number of additional variables (which need 
to be eliminated afterwards) and thus the computations are quite involved. 
The subsequent step of dualizing X^ to get the right hand side of ( 14. 3ft is 
even more forbidding. The resulting hypersurfaces (X^)* tend to have high 
degree and their defining polynomials are very large when n > 3. 

The article [23] offers a detailed study of the case when X is a space curve 
inR 3 . Here the lower bound (g3D tells us that d a X C (X®)*U (X®)*. The 
surface (X^)* is the edge surface of the curve X, and (X®)* is the union of 
all tritangent planes of X. The following example illustrates these objects. 
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Figure 6: The convex hull of the curve (cos(#), cos(2#), sin(3#)) in IR 3 . 



Example 4.4. We consider the trigonometric curve X in M 3 which has the 
parametrization x = cos(#), y = cos(20), z = sin(3#). This is an algebraic 
curve of degree six. Its implicit representation equals X = V(hi, h 2 ) where 

h x = 2x 2 - y - 1 and h 2 = V + 2z 2 - 3y - 1. 

The edge surface (X^)* has three irreducible components. Two of the com- 
ponents are the quadric V(hi) and the cubic V(h 2 ). The third and most in- 
teresting component of (X^)* is the surface of degree 16 with equation /13 = 

- 419904a; 14 ?/ 2 + 664848a a V - 419904a; 10 ?/ + 132192a; 8 ?/ 8 - 20736a; 6 ?/ 10 + 1296a; 4 ?; 12 

- 46656a; 14 z 2 + 373248a; 12 ?/ 2 z 2 - 69984a; 10 ?/ 4 z 2 -22464a; 8 y 6 z 2 +4320a; 6 y 8 z 2 +31104a; 12 z 4 
+ 5184a; 10 y 2 z 4 + 4752a; 8 ?/ 4 z 4 + 1728a ao z 6 + 699840a; 14 y - 46656a; 12 ?/ 3 - 902016a; 10 ?/ 5 
+694656a; 8 y 7 -209088a;V-1150848a; 1 V2 2 +279936a; 8 ?/ 5 z 2 +17280a; 6 y 7 z 2 -4032a; 4 y 9 z 2 

- 98496a; 10 yz 4 + 27072a; 4 ?/ 11 - 1152a; 2 ?/ 13 - 419904a a V 2 - 25920a; 8 y 3 z 4 - 4608a; 6 ?/ 5 z 4 
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- 1728a 8 yz 6 - 291600a: 14 - 169128a; 12 y 2 - 256608a; 10 y 4 + 956880a 8 y 6 - 618192x 6 y 8 

+ 148824a 4 y 10 - 13120a 2 y 12 + 256y 14 + 392688a; 12 z 2 + 671976a; 10 y V + 1454976a 8 y 4 z 2 

- 292608a 6 y 6 z 2 - 4272a 4 y 8 z 2 + 1016xV z 2 -116208x 10 z 4 +135432a 8 y 2 z 4 +18144a;Vz 4 
+ 1264a 4 y 6 z 4 - 5616a; 8 z 6 + 504a; 6 y 2 z 6 - 1108080a 12 y + 925344a; 10 y 3 + 215136a; V 

- 672192a 6 y 7 + 331920a 4 y 9 - 54240a; 2 ?/ 11 + 2304y 13 +273456a; 10 yz 2 +282528a 8 y 3 z 2 

- 1185408a 6 y 5 z 2 + 149376a; 4 y 7 z 2 - 368a; 2 y 9 z 2 - 32y u z 2 +273456a;V 4 -67104a; 6 y 3 z 4 

- 4704a 4 y 5 z 4 - 64a; 2 j/z 4 + 4752a; 6 yz 6 - 32a 4 y 3 z 6 + 747225a; 12 + 636660a 10 y 2 

- 908010a 8 y 4 - 65340a 6 ?/ 6 + 291465a; 4 ?/ 8 - 101712a; 2 ?/ 10 + 8256y 12 - 818100a 10 z 2 

- 1405836a 8 y 2 z 2 - 905634a; 6 ?/ 4 z 2 + 583824a; 4 y 6 z 2 - 39318a; 2 y 8 z 2 + 368y 10 z 2 +193806a 8 z 4 

- 282996a 6 y 2 z 4 + 15450a 4 y 4 z 4 + 716a 2 y 6 z 4 + y 8 z 4 + 6876a; 6 z 6 - 1140a; 4 y 2 z 6 + 2x 2 y 4 z 6 
+ a 4 z 8 + 507384a 10 ?/ - 809568a 8 ?/ 3 + 569592a 6 ?/ 5 - 27216a 4 ?/ 7 - 71648a; 2 ?/ 9 + 13952y n 
+ 555768a 8 ?/z 2 + 869040a; 6 ?/ 3 z 2 + 688512a 4 y 5 z 2 - 154128a 2 y 7 z 2 +4416y 9 z 2 -343224a 6 yz 4 
+ 127360a 4 y 3 z 4 - 1656a 2 y 5 z 4 - 64y 7 z 4 - 4536a; 4 yz 6 +48a 2 y 3 z 6 -775170a; 10 -191808a 8 y 2 
+ 599022a 6 y 4 - 245700a 4 y 6 + 31608a; 2 ?/ 8 + 7872y 10 + 765072a; 8 z 2 + 589788a; 6 ?/ 2 z 2 

- 66066a 4 y 4 z 2 - 234252a 2 y 6 z 2 + 16632y 8 z 2 - 173196a 6 z 4 + 248928a 4 y 2 z 4 - 26158a 2 y 4 z 4 

- 32?/z 4 - 3904a; 4 z 6 + 804a; 2 y 2 z 6 + 2y 4 z 6 - 2x 2 z 8 + 5832a; 8 ?/ + 98280a; 6 ?/ 3 - 219456a; 4 ?/ 5 
+ 72072a 2 ?/ 7 - 8064y 9 - 724032a; 6 yz 2 - 515760a 4 y 3 z 2 - 99672a; 2 y 5 z 2 + 29976y 7 z 2 

+ 225048a 4 yz 4 - 76216a; 2 y 3 z 4 + 1912y 5 z 4 + 1696a; 2 yz 6 - 32y 3 z 6 + 411345a; 8 - 66096a 6 y 2 
-62532a; 4 y 4 +29388a; 2 y 6 -11856y 8 -365346a 6 z 2 +19812a 4 y 2 z 2 +104922a; 2 y 4 z 2 +24636y 6 z 2 
+ 85090a 4 z 4 -104580a; 2 y 2 z 4 +8282y 4 z 4 +1014a 2 z 6 -144y 2 z 6 + z 8 -39744a 6 y+61992a 4 y 3 
+ 2304a 2 y 5 + 576y 7 + 305328a 4 yz 2 + 86640a; 2 y 3 z 2 + 960y 5 z 2 - 73480a; 2 yz 4 + 16024y 3 z 4 

- 200yz 6 - 114966a; 6 + 24120a; 4 y 2 - 5958a; 2 y 4 + 6192y 6 + 85494a; 4 z 2 - 39696a 2 y 2 z 2 

- 11970y 4 z 2 - 21610a 2 z 4 + 16780y 2 z 4 - 94z 6 - 3672a 4 y - 11024a 2 y 3 + 272y 5 

- 46904a 2 yz 2 - 4632y 3 z 2 + 9368yz 4 + 15246a 4 - 84a 2 y 2 - 1908y 4 - 6892a 2 z 2 

+ 2204y 2 z 2 + 2215z 4 + 3216a; 2 y + 168y 3 + 904yz 2 - 664a; 2 + 292y 2 - 282z 2 - 96y + 9. 

The boundary of P = conv(X) contains patches from all three surfaces 
V(h\), V(h 2 ) and V{h^). There are also two triangles, with vertices at 
(V3/2, 1/2, ±1), (y/3/2, 1/2, ±1) and (0, -1, ±1). They span two of the tri- 
tangent planes of X, namely, z — 1 and z — — 1. The union of all tritangent 
planes equals (X^)*. Only one triangle is visible in Figure |6j It is colored 
in yellow. The curved blue patch adjacent to one of the edges of the tri- 
angle is given by the cubic hs, while the other two edges of the triangle lie 
in the degree 16 surface V(hs). The curve X has two singular points at 
(x,y,z) = (±1/2, —1/2,0). Around these two singular points, the boundary 
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is given by four alternating patches from the quadric V{h\) highlighted in 
red and the degree 16 surface V(h 3 ) in green. We conclude that the edge 
surface (X [2] )* = VQi^h^) is reducible of degree 21 = 2 + 3 + 16, and the 
algebraic boundary d a {P) is a reducible surface of degree 23 = 2 + 21. □ 

In our next example we examine the convex hull of space curves of degree 
four that are obtained as the intersection of two quadratic surfaces in IR 3 . 

Example 4.5. Let X = V(h±, h^) be the intersection of two quadratic sur- 
faces in 3-space. We assume that X has no singularities in P 3 . Then X is a 
curve of genus one. According to recent work of Scheiderer [27], the convex 
body P = conv(X) can be represented exactly using Lasserre relaxations, a 
topic we shall return to when discussing spectrahedral shadows in Section El 
If we are willing to work over M then P is in fact a spectrahedron, as shown in 
[24"t Example 2.3]. We here derive that representation for a concrete example. 
Lazard et al. [18j §8.2] examine the curve X cut out by the two quadrics 

h x = x 2 + y 2 + z 2 - 1 and h 2 = 19x 2 + 22y 2 + 21z 2 - 20. 

Figure [7J shows the two components of X on the unit sphere V(h\). 




Figure 7: The curve on the unit sphere discussed in Example 14.51 and 16.51 

The dual variety X* is a surface of degree 8 in (P 3 )*. The singular locus of 
X* contains the curve X^ which is the union of four quadratic curves. The 
duals of these four plane curves are the singular quadratic surfaces defined by 

h 3 = x 2 -2y 2 -z 2 , h A = 2x 2 -y 2 -l, h 5 = 3y 2 + 2z 2 -l, h 6 = 3x 2 + z 2 -2. 
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The edge surface of X is the union of these four quadrics: 

(X [2] y = V(h 3 ) U V(h 4 ) U V(h s ) U V(h 6 ). 

The algebraic boundary of P consists of the last two among these quadrics: 

d a P = V(h 5 ) U V(he). 

These two quadrics are convex. From this we derive a representation of P as 
a spectrahedron by applying Schur complements to the quadrics h 5 and he- 

fi+Vzy V2z o o \ 

y/2z l-y/3y 1 

V2+x t/3x ~ J ' 

\ v^x V2-xJ 



(x,y,z) e 



5 Spectrahedra and Semidefinite Programming 

Spectrahedra and semidefinite programming (SDP) have already surfaced a 
few times in our discussion. In this section we take a systematic look at these 
topics and their dualities. We write S n for the space of real symmetric nxn- 
matices and 5™ for the cone of positive semidefinite matrices in S n ~ 2 /. 
This cone is self-dual with respect to the inner product (U, V) = trace ({/■ V). 
A spectrahedron is the intersection of the cone with an affine subspace 

fC = C + Span(Ai, A 2 , . . . , A m ) . 



w 



Here W is a linear subspace of dimension m in S n , and the spectrahedron is 

m 

p = {xeric-^^A^o} ~ /cn5™. (5.i) 



i=i 



We shall assume that C is positive definite or, equivalently, that G int(P). 
The dual body to our spectrahedron is written in the coordinates on W m as 

P A = {yeW m | VxeP with (y,x) < 1} . 

We can express P A as a projection of the ) -dimensional spectrahedron 

Q = {UeS'l | (U,C) < 1}. (5.2) 
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Figure 8: The elliptope P = £3 and its dual convex body P A . 

Namely, consider the linear map dual to the inclusion of the linear subspace 
W = Span(y4 1; A 2 , . . . , An) in the ( n ^ 1 ) -dimensional real vector space S n : 

C/ ^ ((f/,A 1 ),(f/,A 2 ),...,(C/,A m )). 

Remark 5.1. The convex body P A dual to the spectrahedron P is affinely 
isomorphic to the closure of the image of the spectrahedron ( 15. 2 p under the 
linear map 7r>v, i.e. P A ~ ttw(Q). This result is due to Ramana and Gold- 
man [23]. In summary, while the dual to a spectrahedron is generally not a 
spectrahedron, it is always a spectrahedral shadow. See also Theorem 16.11 

Example 5.2. The elliptope S n is the spectrahedron consisting of all correla- 
tion matrices of size n, see [15]. These are the positive semidefinite symmetric 
n x n-matrices whose diagonal entries are 1. We consider the case n = 3: 





f 1 


X 






1 X 


1 






\y 


z 





(5.3) 



This spectrahedron of dimension m = 3 is shown on the left in Figure El The 
algebraic boundary of S3 is the cubic surface X defined by the vanishing of 
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the 3 x 3-determinant in (15. 3p . That surface has four isolated singular points 

X sing = {(1, 1, 1), (1, -1, -1), (-1, 1, -1), (-1, -1, 1)}. 

The six edges of the tetrahedron conv(X sing ) are edges of the elliptope 8 3 . 
The dual body, shown on the right of Figure El is the spectrahedral shadow 




3u,v G E : [a v c ^ > . (5.4) 



St = <(a,b,c)e 



The algebraic boundary of 8% can be computed by the following method. 
We form the ideal generated by the determinant in ( 15. 4 p and its derivatives 
with respect to u and v, and we eliminate u, v. This results in the polynomial 

(a 2 b 2 + b 2 c 2 + a 2 c 2 -2abc)(a + b + c+l)(a-b-c+l)(a-b + c-l)(a + b-c-l). 

The first factor is the equation of the Steiner quartic surface X* which is 
dual to Cayley cubic surface X = d a 8-s. The four linear factors represent the 
arrangement (X sing )* of the four planes dual to the four singular points. 
Thus the algebraic boundary of the dual body 8% is the reducible surface 

d a 8* = X* U (X sing )* C (P 3 )*. (5.5) 

We note that 8% is not a spectrahedron as it fails to be a basic semi- algebraic 
set, i.e. no polynomial satisfies 8% = {(a,b,c) G M 3 : (f>(a,b,c) > 0}. This 
is seen from the fact that the Steiner surface intersects the interior of 8^. □ 

Semidefinite programming (SDP) is the branch of convex optimization 
that is concerned with maximizing a linear function b over a spectrahedron: 

p* := Maximize (b, x) subject to x G P. (5-6) 

X 

Here P is as in (15. ip . As the semidefiniteness of a matrix is equivalent to the 
simultaneous non-negativity of its principal minors, SDP is an instance of 
the polynomial optimization problem ( 12. 7p . Lagrange duality theory applies 
here by [H §5]. We shall derive the optimization problem dual to ( I5.6P from 

d* := Minimize A subject to —b G P A . (5.7) 
A A 

Since we assumed G int(P), strong duality holds and we have p* = d*. 
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The fact that P A is a spectrahedral shadow implies that the dual opti- 
mization problem is again a semidefmite optimization problem. In light of 
Remark 15. 1[ the condition jb £ P A can be expressed as follows: 

3 U : U y , (C, U) < 1 and 6* = \(A h U) for i = 1,2, . . . ,m. 

Since the optimal value of (15. 7p is attained at the boundary of P A , we can 
here replace the condition (C, U) < 1 with (C, U) — 1. This is in fact what 
was done to obtain (15.41) . If we now set Y = \U, then (15.71) translates into 

d* := Minimize (C, Y) 

subject to (Ai, Y) = b { for % = 1, . . . , m ( 5 - 8 ) 

and Y h 0. 

We recall that W = Span(Ai, A<i, . . . , A m ) and we fix any matrix B £ S n 
with (Ai, B) = bi for % = 1, . . . , m. Then (15. 8p can be written as follows: 

d* := Minimize (C, Y) subject to Y £ (B + W ± ) n 5™ (5.9) 

The following reformulation of (15. 6p highlights the symmetry between the 
primal and dual formulations of our semidefmite programming problem: 

p* := Maximize (B, C - X) subject to X £ (C + W) H (5.10) 

Then the following variant of the Karush-Kuhn- Tucker Theorem holds: 

Theorem 5.3. [U §5.9.2] If both the primal problem Ii5.10\) and its dual Ii5.9\) 
are strictly feasible, then the KKT conditions take the following form: 

x £ (c + w)ns$ 
Y e(B + w L ) n S'l 

X ■ Y = (complementary slackness). 
These conditions characterize all the pairs (X, Y) of optimal solutions. 

This theorem can be related to the general optimality conditions (12.81) 
by regarding the entries of Y £ S n as the (Lagrangian) dual variables to 
the positive semidefinite constraint X = C — Y^iLi x iAi b 0. The three 
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conditions are both necessary and sufficient since semideflnite programming 
is a convex problem and every local optimum is also a global optimal solution. 

In order to study algebraic and geometric properties of SDP, we will relax 
the conic inequalities X, Y e <S™ and focus only on the KKT equations 

X eC + W , Y e B + W ± and X -Y = 0. (5.11) 

Given B, C and W, our problem is to solve the polynomial equations (15. lip . 
The theorem ensures that, among its solutions (X, Y), there is precisely one 
pair of positive semideflnite matrices. That pair is the one desired in SDP. 

Example 5.4. Consider the problem of minimizing a linear function Y t— > 
(C, Y) over the set of all correlation matrices Y, that is, over the elliptope E n 
of Example 15.21 Here m = n, B is the identity matrix, W is the space of all 
diagonal matrices, and W ± consists of matrices with zero diagonal. The dual 
problem is to maximize the trace of C — X over all matrices X G 5" such that 
C — X is diagonal. Equivalently, we seek to find the minimum trace t* of any 
positive semideflnite matrix that agrees with C in its off-diagonal entries. 
For n — 4, the KKT equations ( 15. lip can be written in the form 

/ Xi C12 C13 C M \ 

Y C 12 x 2 023 C24 

Cl3 c 23 ^3 c 34 

\Cl4 C24 C34 £4 J 

This is a system of 10 quadratic equations in 10 unknowns. For general 
values of the 6 parameters Cy, these equations have 14 solutions. Eight of 
these solutions have rank(X) = 3 and rank(F) = 1 and they are defined 
over Q(cjj). The other six solutions form an irreducible variety over Q(cjj) 
and they satisfy rank(X) = rank(y) = 2. This case distinction reflects the 
boundary structure of the dual body to the six-dimensional elliptope £4: 

<9 a £ 4 A = {rank(F) < 2}* U {rank(F) = 1}*. (5.13) 

Indeed, the boundary of 84 is the quartic hypersurface {rank(F) < 3}, its 
singular locus is the degree 10 threefold {rank(F) < 2}, and, finally, the 
singular locus of that threefold consists of eight matrices of rank one: 

(rank(F) = 1} = { (u u u 2 , u 3 , w 4 ) T ■ (u u u 2 , it 3 , u 4 ) : ^€{-1,+!}}. 



/ 1 2/12 2/13 Vu\ 

Vl2 1 2/23 2/24 

2/13 2/23 1 2/34 

V2/14 2/24 2/34 1 I 



0. (5.12) 
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The last two strata are dual to the hypersurfaces in (15.131) . The second com- 
ponent in ( I5.13P consists of eight hyperplanes, while the first component is 
irreducible of degree 18. The corresponding projective hypersurface is defined 
by an irreducible homogeneous polynomial of degree 18 in seven unknowns 
C12, C13, C14, c 2 3, c 2 4, c 34 , t* . That polynomial has degree 6 in the special un- 
known t*. Hence, the algebraic degree of our SDP is 6 when rank(F) = 2. □ 

In algebraic geometry, it is natural to regard the matrix pairs (X, Y) as 
points in the product of projective spaces P(iS n ) x F(S n )*. This has the 
advantage that solutions of ( 15. lip are invariant under scaling, i.e. whenever 
(X,Y) is a solution, then so is (XX,fjY) for any nonzero A, \i G R. In that 
setting, there are no worries about complications due to solutions at infinity. 

For the algebraic formulation we assume that, without loss of generality, 

6 1 = l,6 2 = 0,6 3 = 0,...,6 m = 0. 

This means that (Ai,X) = 1 plays the role of the homogenizing variable. Our 
SDP instance is specified by two linear subspaces of symmetric matrices: 

£ = Span(A 2 ,A 3 ,...,A m ) c W = Span(C,A 1 ,A 2 ,...,i4 m ) c S n . 

Note that we have the following identifications: 

RC + W = U and IRB + W 1 =RB + (£ x n^) = £ 1 . 

With the linear spaces C C U, we write the homogeneous KKT equations as 

X EU, Y e£ x andX-Y = 0. (5.14) 

Here is an abstract definition of semide finite programming that might appeal 
to some of our readers: Given any flag of linear subspaces C CW C S n with 
dim(W/£) = 2, locate the unique semidefinite point in the variety (15. 14ft . For 
instance, in Example 15.41 the space L consists of traceless diagonal matrices 
and U/C is spanned by the unit matrix B and one off-diagonal matrix C. 
We seek to solve the matrix equation X ■ Y = where the diagonal entries 
of X are constant and the off-diagonal entries of Y are proportional to C . 

The formulation (I5.14p suggests that we study the variety {XY = 0} for 
pairs of symmetric matrices X and Y. In [2"2~| Eqn. (3.9)] it was shown that 
this variety has the following decomposition into irreducible components: 

n-l 

{XY = 0} = |J{XF = 0} r C P(5 n ) x F(S n )*. 

r=l 
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Here {XY = 0} r denotes the subvariety consisting of pairs (X, Y) where 
rank(X) < r and rank(F) < n — r. This is irreducible because, by Exam- 
ple I2.8[ it is the conormal variety of the variety of symmetric matrices of 
rank < r. The KKT equations describe sections of these conormal varieties: 

{XY = 0} r n (P(W) x P(£ x ) ). (5.15) 

All solutions of a semidefmite optimization problem (and thus also the bound- 
ary of a spectrahedron and its dual) can be characterized by rank conditions. 
The main result in [22] describes the case when the section in (I5.15P is generic: 

Theorem 5.5. [2"2"l Theorem 7] For generic subspaces C C U C S n with 
dim(£) = m — 1 and dim(W) = m + 1, the variety (15.151) is empty unless 

( | < m and ( ) < ( ) — m. (5.16) 

\ 2 j- v 2 j - v 2 ; y } 

In that case, the variety f 1 5 . 1 5 j) is reduced, nonempty and zero- dimensional 
and at each point the rank of X and Y is n — r and r respectively ( strict 
complementarity). The cardinality of this variety depends only onm, n andr. 

The generic choice of subspaces £ C U corresponds to the assumption 
that our matrices Ai, A2, • • • , A m , B, C lie in a certain dense open subset in 
the space of all SDP instances. The inequalities (I5.16P are known as Pataki 's 
inequalities. If m and n are fixed then they give a lower bound and an 
upper bound for the possible ranks r of the optimal matrix of a generic 
SDP instance. The variety (I5.15P represents all complex solutions of the 
KTT equations for such a generic SDP instance. Its cardinality, denoted 
5(m,n,r), is known as the algebraic degree of semidefinite programming. 

Corollary 5.6. Consider the variety of symmetric nxn-matrices of rank < r 
that lie in the generic m- dimensional linear subspace P(W) ofF(S n ). Its dual 
variety is a hypersurface if and only if Pataki 's inequalities ( f 5. 1 6\) hold, and 
the degree of that hypersurface is S(m,n,r), the algebraic degree of SDP. 

Proof. The genericity of U ensures that {XY = 0} r n (P(W) x P(W)*) is the 
conormal variety of the given variety. We obtain its dual by projection onto 
the second factor P(Z/)* = P(5™/W" L ). The degree of the dual hypersurface 
is found by intersecting with a generic line. The line we take is P(£ _L /ZY" L ). 
That intersection corresponds to the second factor P(£ ± ) in (I5.15p . □ 
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We note that the symmetry in the equations (15.141) implies the duality 



5(m,n,r) = #(( ^ J — m,n,n — r), 

first shown in [22j Proposition 9]. See also [22| Table 2]. Bothmer and Ranes- 
tad jH] derived an explicit combinatorial formula for the algebraic degree of 
SDP. Their result implies that d~(m, n, r) is a polynomial of degree m in n 
when n — r is fixed. For example, in addition to j22j Theorem 11], we have 

6(6,n,n-2) = ^(lln 6 - 81n 5 + 185n 4 - 75n 3 - 196n 2 + 156n). 

The algebraic degree of SDP represents a universal upper bound on the 
intrinsic algebraic complexity of optimizing a linear function over any Tri- 
dimensional spectrahedron of nxn-matrices. The algebraic degree can be 
much smaller for families of instances involving special matrices Ai, B or C. 

Example 5.7. Fix n = 4 and m = 6 = dim(£ 4 ). Pataki's inequalities ( 15.1 6p 
state that the rank of the optimal matrix is r = 1 or r = 2, and this was 
indeed observed in Example 15.41 For r = 2 we had found the algebraic 
degree six when solving (I5.12p . However, here B is the identity matrix and 
Ai, A2, A3, A4 are diagonal. When these are replaced by generic symmetric 
matrices, then the algebraic degree jumps from six to 5(6,4,2) = 30. □ 

We now state a result that elucidates the decompositions in (I5.5P and (I5.13p . 

Theorem 5.8. // the matrices A\, . . . , A m and C in the definition Ii5. 1\) of 
the spectrahedron P are sufficiently generic, then the algebraic boundary of 
the dual body P A is the following union of dual hyper surf aces: 

d a P A C |J {X G C I rank(X) < r}* (5.17) 

r as m 

Proof. Let y be any irreducible component of d a P A C (P m )*. Then yndP A 
is a semi-algebraic subset of co dimension 1 in P A . We consider a general 
point in that set. The corresponding hyperplane H in the primal M m supports 
the spectrahedron P at a unique point Z. Then r = rank(Z) satisfies Pataki's 
inequalities, by Theorem 15.51 Moreover, the genericity in our choices of 
Ai, . . . , A m , C, H ensure that Z is a regular point in {X G C \ rank(X) < r}. 
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Bertini's Theorem ensures that this determinantal variety is irreducible and 
that its singular locus consists only of matrices of rank < r. This implies 
that {X £ £ | rank(X) < r} is the Zariski closure of {X £ P | rank(X) = r}, 
and hence also of a neighborhood of Z in that rank stratum. Likewise, y is 
the Zariski closure in (P m )* of y n dP A . An open dense subset of points in 
y H dP A corresponds to hyperplanes that support P at a rank r matrix. We 
conclude y* = {X £ C | rank(X) < r}. Biduality completes the proof. □ 

Theorem 15.81 is similar to Theorem 14.21 in that it characterizes the alge- 
braic boundary in terms of dual hypersurfaces. Just like in Section m we 
can apply this result to compute d a P A . For each rank r in the Pataki range 
f !5.16p . we need to check whether the corresponding dual hypersurface meets 
the boundary of P A . The indices r which survive this test determine d a P A . 

When the data that specify the spectrahedron P are not generic but 
special then the computation of d a P A is more subtle and we know of no 
formula as simple as (j5.17p . This issue certainly deserves further research. 

We close this section with an interesting 3-dimensional example. 

Example 5.9. The cyclohexatope is a spectrahedron with m = 3 and n = 5 
that arises in the study of chemical conformations [10] . Consider the following 
Schdnberg matrix for the pairwise distances \TDij among six carbon atoms: 

/ 2D 12 D 12 +D 13 -D 23 D 12 +D 14 -D 2 ± D 12 +D 15 -D 25 D 12 +D 16 -D 26 \ 

D 12 +D 13 -D 23 2D 13 D 13 +D 14 -D 34 D 13 +D 15 -D 35 D 13 +D 16 -D 36 

D 12 +D u -D 24 D 13 +D M -D U 2D 14 Du+D 15 -D 45 D 14 +D w -D 46 

D 12 +D 15 -D 25 D 13 +D 15 -D 35 D 14 +D 15 -D 45 2D 15 D 15 +D 5e -D 56 

\D 12 +D W -D 26 D 13 +D w -D 36 D 14 +D 16 -D 46 D lb +D m -D m 2D 16 

The Dij are the squared distances among six points in M 3 if and only if this 
matrix is positive-semidefinite of rank < 3. The points represent the carbon 
atoms in cyclohexane CqHi 2 if and only if -D^+i = 1 and D i<i+2 = 8/3 for all 
indices i, understood cyclically. The three diagonal distances are unknowns, 
so, for cyclohexane conformations, the above Schdnberg matrix equals 

/ 

C 6 (x,y,z) = 



2 


8/3 


x-5/3 


11/3 -y 


-2/3 \ 


8/3 


2 


5/3 + x 


8/3 


11/3-2 


x — 5/3 


5/3 + x 


16/3 


x + 5/3 


x — 5/3 


11/3-j/ 


8/3 


x + 5/3 


2y 


8/3 


-2/3 


11/3 -z 


x-5/3 


8/3 


16/3 J 
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The cyclohexatope Cyc 6 is the spectrahedron in IR 3 defined by C 6 (x, y, z) y 0. 
Its algebraic boundary decomposes as <9 a Cyc 6 = V(f) U V(g), where 

/ = 27 xyz — 75a; — 75y — 75z — 250 and 
g = 3xy + 3xz + 3yz- 22x-22y-22z+121. 

The conformation space of cyclohexane is the real algebraic variety 

{(x,y, z) G Cyc 6 | rank(C 6 (x, y, 2:)) < 3 } = V{f,g) U V(g) sing . 

The first component is the closed curve of all chair conformations. The sec- 
ond component is the boat conformation point (x, y, z) = (y , y , y) . These 
are well-known to chemists [TO]. Remarkably, the cyclohexatope coincides 
with the convex hull of these two components. This spectrahedron is an- 
other example of a convex hull of a space curve, now with an isolated point. 
Semidefinite programming over the cyclohexatope means computing the con- 
formation which minimizes a linear function in the squared distances Dij. □ 



6 Spectrahedral Shadows 

A spectrahedral shadow is the image of a spectrahedron under a linear map. 
The class of spectrahedral shadows is much larger than the class of spectra- 
hedra. In fact, it has even been conjectured that every convex basic semial- 
gebraic set in M. n is a spectrahedral shadow [13]. Our point of departure is 
the result, known to optimization experts, that the convex body dual to a 
spectrahedral shadow is again a spectrahedral shadow [TTJ Proposition 3.3]. 

Theorem 6.1. The class of spectrahedral shadows is closed under duality. 

Construction. A spectrahedral shadow can be written in the form 

{m p 
x e R m I 3 y e R p with C + x { A { + ^ yjBj h 
i=\ i=i 

An expression for the dual body P A is obtained by the following variant of 
the construction in Remark 15.11 We consider the same linear map as before: 

vr : S r ; -> R m , U m- ({Ax, to, ... , (An, CO). 
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We apply this linear map it to the spectrahedron 



Q = {U e SI | (C, U) < 1 and C/) 



(E p ,J7)=0}. 



The closure of the spectrahedral shadow n(Q) equals the dual convex body 
P A . This closure is itself a spectrahedral shadow, by [HI Corollary 3.4]. □ 

We now consider the following problem: Given a real variety X C M n , 
find a representation of its convex hull conv(X) as a spectrahedral shadow. 
A systematic approach to computing such representations was introduced by 
Lasserre [16], and further developed by Gouveia et al. [12]. It is based on the 
relaxation of non-negative polynomial functions on X as sums of squares in 
the coordinate ring K.[X]. This approach is known as moment relaxation, in 
light of the duality between positive polynomials and moments of measures. 

We shall begin by exploring these ideas for homogeneous polynomials 
of even degree 2c? that are non-negative on M. n . These form a cone in a real 
vector space of dimension N = • Inside that cone lies the smaller SOS 

cone of polynomials p that are sums of squares of polynomials of degree d: 



By Hilbert's Theorem [20], this inclusion of convex cones is strict unless 
(n, 2d) equals (l,2d) or (n, 2) or (3,4). The SOS cone is easily seen to be a 
spectrahedral shadow. Indeed, consider an unknown symmetric matrix Q e 
S N and write p = v T Qv where v is the vector of all TV monomials of degree 
d. The matrix Q is positive semidefinite if it has a Cholesky factorization 
Q = C T C. The resulting identity p = (Cv) T (Cv) can be rewritten as ( 16. ip . 
Hence the SOS cone is the image of under the linear map Q i— > v T Qv. 

Recent work of Nie [21] studies the boundaries of our two cones via com- 
putations with the discriminants we encountered at the end of Section 12.41 

Proposition 6.2 (Theorem 4.1 in [21]). The algebraic boundary of the cone 
of homogeneous polynomials p of degree 2d that are non-negative on M. n is 
given by the discriminant of a polynomial p with unknown coefficients. This 
discriminant is the irreducible hypersurface dual to the Veronese embedding 



P 



(6.1) 



,7V- 1 



( Xl : ■ ■ ■ : x n ) i-> (xf :xf l x 2 : • • • : x d n ) 



The degree of this discriminant is n(2d — 1) 
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rank 3 



Figure 9: The discriminant in Example 16 . 3 1 defines a curve in the (a, 6)-plane. 
The spectrahedral shadow C is the set of points where the ternary quartic f a ,b 
is SOS. The ranks of the corresponding SOS matrices Q are indicated. 

Proof. The discriminant of p vanishes if and only if there exists x G P n_1 
with p(x) = and Vp| =0. If p is in the boundary of the cone of positive 
polynomials then such a real point x exists. For the degree formula see [7] . □ 

Results similar to Proposition 16.21 hold when we restrict to polynomials p 
that lie in linear subspaces. This is why the A- discriminants A& from Sec- 
tion [23] are relevant. We show this for a 2-dimensional family of polynomials. 

Example 6.3. Consider the two-dimensional family of ternary quartics 

fa,b(x, y, z) = x 4 + y A + ax 3 z + ay 2 z 2 + by 3 z + bx 2 z 2 + (a + b)z A . 

Here a and b are parameters. Such a polynomial is non-negative on R 3 if and 
only if it is a sum of squares, by Hilbert's Theorem. This condition defines 
a closed convex region C in the (a, 6)-plane R 2 . It is non-empty because 
(0, 0) G C. Its boundary d a (C) is derived from the A- discriminant Aa, where 

(A 3 2 0\ 
A = 0402300 . 
\0 012124/ 
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This A-discriminant is an irreducible homogeneous polynomial of degree 24 
in the seven coefficients. What we are interested in here is the specialized dis- 
criminant which is obtained from by substituting the vector of coefficients 
(1, 1, a, a, b, b, a+b) corresponding to our polynomial f a ,b- The specialized dis- 
criminant is an inhomogeneous polynomial of degree 24 in the two unknowns 
a and b, and it is no longer irreducible. A computation reveals that it is the 
product of four irreducible factors whose degrees are 1, 5, 5 and 13. 
The linear factor equals a + b. The two factors of degree 5 are 

256a 2 -27a 5 +512a6+144a 3 6-27a 4 6+2566 2 -128a6 2 +144a 2 6 2 -1286 3 -4a 2 6 3 +166 4 , 
256a 2 -128a 3 +16a 4 +512a6-128a 2 6+2566 2 +144a 2 6 2 -4a 3 6 2 +144a6 3 -27a6 4 -276 5 . 

Finally, the factor of degree 13 in the specialized discriminant equals 

2916a n 6 2 + 19683a 9 6 4 + 19683a 8 fr 5 + 2916a 7 6 6 + 29 16a 6 6 7 + 19683a 5 6 8 
+19683a 4 6 9 + 2916a 2 £> n - 11664a 12 - 104976a 10 6 2 - 136080a 9 6 3 - 27216a 8 o 4 
-225504a 7 o 5 - 419904a 6 6 6 - 225504a 5 6 7 - 27216a 4 6 8 - 136080a 3 6 9 
-104976a 2 6 10 - 116646 12 + 93312a 11 + 217728a 10 & + 76032aV 
+1133568a 8 6 3 + 1976832a 7 6 4 + 891648a 6 6 5 + 891648a 5 6 6 + 1976832a 4 6 7 

+ 1 133568a 3 b 8 + 76032a 2 6 9 + 2 1 77 28a6 10 + 933126 11 - 241920a 10 
-1368576a 9 6 - 2674944a 8 fe 2 - 1511424a 7 6 3 - 4729600a 6 6 4 - 9369088a 5 6 5 
-4729600a 4 6 6 - 1511424a 3 6 7 - 2674944a 2 6 8 - 1368576a6 9 - 2419206 10 
+663552a 9 + 2949120a 8 6 + 10539008a 7 b 2 + 17727488a 6 6 3 + 9981952a 5 6 4 
+9981952a 4 6 5 + 17727488a 3 6 6 + 10539008a 2 & 7 + 2949120a6 8 + 6635526 9 
-2719744a 8 - 8847360a 7 6 - 14974976a 6 6 2 - 36503552a 5 6 3 - 56360960a 4 6 4 
-36503552a 3 b 5 - 14974976a 2 6 6 - 8847360a6 7 - 27197446 8 + 4587520a 7 
+25821184a 6 6 + 52035584a 5 b 2 + 50724864a 4 6 3 +50724864a 3 6 4 +52035584a 2 6 5 
+25821184a6 6 + 45875206 7 - 6291456a 6 - 31457280a 5 6 - 94371840a 4 6 2 
- 138412032a 3 b 3 - 94371840a 2 6 4 - 31457280a6 5 - 62914566 6 + 16777216a 5 
+50331648a 4 6 + 67108864a 3 6 2 + 67108864a 2 6 3 + 50331648a6 4 + 167772166 5 
-16777216a 4 - 67108864a 3 6 - 100663296a 2 b 2 - 67108864a6 3 - 167772166 4 . 

The relevant pieces of these four curves in the (a, 6)-plane are depicted in 
Figure [9j The line a + b = is seen in the lower left, the degree 13 curve 
is the swallowtail in the upper right, and the two quintic curves form the 
upper-left and lower-right boundary of the enclosed convex region C. 
For each (a, 6) G C, the ternary quartic f ay b has an SOS representation 

f a , b (x,y,z) = (x 2 ,xy,y 2 ,xz,yz,z 2 ) ■ Q ■ (x 2 , xy,y 2 , xz,yz, z 2 ) T ', 

where Q is a positive semidefmite 6x6-matrix. This identity gives 15 indepen- 
dent linear constraints which, together with Q >z 0, define an 8-dimensional 
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spectrahedron in the (21 + 2)-dimensional space of parameters (Q, a, b). The 
projection of this spectrahedron onto the (a, 6)-plane is our convex region C. 
This proves that C is a spectrahedral shadow. If (a, b) lies in the interior of 
C then the fiber of the projection is a 6-dimensional spectrahedron. If (a, b) 
lies in the boundary dC then the fiber consists of a single point. The ranks 
of these unique matrices are indicated in Figure M Notice that dC has three 
singular points, at which the rank drops from 5 to 4 and 3 respectively. □ 

We now shift towards a functional analytic point of view. The degree d 
is no longer fixed, and we consider all polynomials, not just homogeneous 
ones. Polynomials that are non-negative on ~R n form a convex cone C in the 
infinite-dimensional real vector space M[xx, ■ ■ ■ ,x n }. Its dual cone C* is the 
set of all linear functionals . . . ,x n ] — > R that are non-negative on C. 

We consider functionals that evaluate to 1 on the constant polynomial 1. 
These are represented by the moments of probability measures /i on 1": 



y a = x a d/j, for a G N™. 

Points in C* are moment sequences (y a ) G 1R N " of Borel measures /i on l n . 

This setup allows for an elegant and fruitful interpretation of Lagrange 
duality for polynomial optimization problems (12. 7p . To keep the exposition 
and notation simple, we restrict ourselves to the unconstrained problem 



minimize fix) = > f r 



x a 



Here we assume that / is bounded from below, say / > e, and deg(/) = 2d. 
Our problem is equivalent to finding the best possible lower bound: 

maximize t 

tm ( 62 ) 
subject to f(x) - 1 > for all x G M n . 

The Lagrange dual of the problem (16. 2p reads 

minimize / f(x)dfx, 
where V is the convex set of all Borel probability measures on IR n . We can 
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rewrite this now as an infinite-dimensional linear optimization problem: 



minimize 

y&y 



a 



(6.3) 



where 




x a djj, with \i G V 



The two dual problems (I6.2p and (16. 3p are as difficult to solve as our 
original optimization problem. There is a natural relaxation which is easier, 
and we can express this either on the primal side or on the dual side. In (16.21) 
we replace the constraint that f(x) — t be non-negative on M. n with the 
easier constraint that f(x) — t be a sum of squares. We relax the dual 
( 16. 3 p by enlarging the convex set y to the infinite- dimensional spectrahedron 
consisting of all positive semidefinite moment matrices 



These two relaxations are again related by Lagrange duality, but now they 
represent a dual pair of semidefinite programs. Of course, when we solve 
such an SDP in practise, we always restrict to a finite submatrix of M(y), 
usually that indexed by all monomials x a ^x^ of some bounded degree < d. 
The question of when such a relaxtion is exact and, if not, how large the gap 
can be, is an active area of research in convex algebraic geometry [121 El |27]. 

We now turn our attention to a variant of the above procedure which 
approximates the convex hull of a variety by a nested family of spectrahedral 
shadows. Let / be an ideal in W[x\, . . . , x n ] and Vr(J) its variety in M n . 
Consider the set of affine-linear polynomials that are non-negative on Vr(I): 



In light of the biduality theorem for convex sets (cf. Section 12. 2p . we can 
characterize the (closure of) the convex hull of our variety as follows: 



The geometry behind this formula is shown in Figure [TU1 

Following Gouveia et al. p2], we now replace the hard constraint that 
f(x) be non-negative on Vr(I) with the (hopefully easier) constraint that 
f(x) be a sum of squares in the coordinate ring ~R[x\, . . . , x n ]/I. Introducing 



M(y) = (y a+p ) 



y o. 



NN(J) 



{ / G M[xi, . . . , i n ]<i I f{x) > for all x G V R (I)}. 



conv(VR(/)) 



{x G R n | f(x) > for all / G NN(J)}. 
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Figure 10: Convex hull as intersection of half spaces. 



a parameter d that indicates the degree of the polynomials allowed in that 
SOS representation, we consider the following set of affine-linear polynomials: 

S0S d (7) = { / | / - q\ q 2 r e I for some & 6 R[x ly ...,x n ]< d }. 

The following chain of inclusions holds: 

SOSi(J) C SOS 2 (/) C SOS3CO C • • • C NN(J). (6.4) 

We now dualize the situation by considering the subsets of R n where the 
various / are non-negative. The d-th theta body of the ideal / is the set 

TH d (/) = {iel n | f(x) > for all / G SOS d (J)}. 

The following reverse chain of inclusions holds among subsets in R n : 

THx(I) 3TH 2 (/) DTH 3 (/) 2 ■•• 2 conv(V R (/)). (6.5) 

This chain of outer approximations can fail to converge in general, but there 
are various convergence results when the geometry is nice. For instance, if 
the real variety Vr(J) is compact then Schmiidgen's Positivstellensatz [271 
§3] ensures asymptotic convergence. When Vr(I) is a finite set, so that 
conv(Vji(/)) is a polytope, then we have finite convergence, that is, 3d : 
THrf(J) = conv(^(/)). This was shown in [T3]. For more information on 
theta bodies see [12] . The main point we wish to record here is the following: 
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Theorem 6.4. ([T2J H7J) Each theta body TH^(J) is a spectrahedral shadow. 

Proof. We may assume, without loss of generality, that the origin lies in 
the interior of conv(VR(/)). Then SOSd(J) is the cone over the convex set 
dual to TH^(J). Since the class of spectrahedral shadows is closed under 
duality, and under intersecting with affine hyperplanes, it suffices to show 
that SOSd(J) is a spectrahedral shadow. But this follows from the formula 
/ — q\ — ■ ■ ■ — q 2 G /, by an argument similar to that given after (16. ip . □ 

In this article we have seen two rather different representations of the 
convex hull of a real variety, namely, the characterization of the algebraic 
boundary in Section HI and the representation as a theta body suggested 
above. The relationship between these two is not yet well understood. A 
specific question is how to best compute the algebraic boundary of a spec- 
trahedral shadow. This leads to problems in elimination theory that seem to 
be particularly challenging for current computer algebra systems. 

We conclude by revisiting one of the examples we had seen in Section HI 

Example 6.5. ( Example 14 . 5 1 cont . ) We revisit the curve X = V(hi, h 2 ) with 

hi = x 2 + y 2 + z 2 - 1, 

h 2 = 19x 2 + 21y 2 + 22z 2 - 20. 

Scheiderer [27] proved that finite convergence holds in (16. 5p whenever I de- 
fines a curve of genus 1, such as X. We will show that d = 1 suffices in our 
example, i.e. we will show that THi(J) = conv(AT) for the ideal I = (hi, h 2 )- 
We are interested in afnne-linear forms / that admit a representation 

/ = 1 + ux + v y + wz = Hihi + fi 2 h 2 + q 2 . (6.6) 

i 

Here /ii and fi 2 are real parameters. Moreover, we want / to lie in SOSi(J), 
so we require degg» = 1 for all i. The sum of squares can be written as 

^2<li = Q,x,y,z) ■ Q • (l,x,y,z) T , where Q G 

i 

After matching coefficients in (16.61) . we obtain the spectrahedral shadow 
SOSi(J) = {(u,v,w) G 1R 3 | 3/i!,/i 2 : 

/l + [l\ + 20/i2 u v w \ 

u —fii — 19//2 . 

V -fJL! - 21^i 2 - 

y w -y,! - 22^ 2 J 
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Figure 11: Convex hull of the curve in Figure [7] and its dual convex body. 



Dual to this is the theta body THi(J) = SOSi(/) A . It has the representation 



THi(J) = {(x,y,z) G M 3 | 3ui,m 2 ,«3,M4 : 



(I 


X 


y 


z\ 


X 


2 1 

3 3^ 4 


Ui 




y 


Ml 


1 2 

3 3^4 


U 3 


V 


"2 


«3 





to}. 



We consider the ideal generated by this 4x4-determinant and its derivatives 
with respect to U\, u 2 , «3, «4, we saturate by the ideal of 3x3- minors, and then 
we eliminate Ui,u 2 , 1*3,1*4. The result is the principal ideal {fi4h 5 h 6 ), with hi 
as in Example 14.51 This computation reveals that the algebraic boundary of 
conv(X) consists of quadrics, and we can conclude that TH^J) = conv(X). 

Pictures of our convex body and its dual are shown in Figure [TTJ Dia- 
grams such as these can be drawn fairly easily for any spectrahedral shadow in 
IR 3 . To be precise, the matrix representation of THi(J) and SOSi(/) A given 
above can be used to rapidly sample the boundaries of these convex bodies, 
by maximizing many linear functions via semidefinite programming. □ 
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